Approximation of value function of differential game with minimal cost

نویسندگان

چکیده

The paper is concerned with the approximation of value function zero-sum differential game minimal cost, i.e., payoff functional determined by minimization some quantity along trajectory solutions continuous-time stochastic games stopping governed one player. Notice that auxiliary described Isaacs–Bellman equation additional inequality constraints. a parabolic PDE for case and it takes form system ODEs Markov game. developed in based on concept guide first proposed Krasovskii Kotelnikova.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

‏‎interpersonal function of language in subtitling

‏‎translation as a comunicative process is always said to be associated with various aspects of meaning loss or gain. subtitling as a mode of translating, due to special discoursal and textual conditions imposed upon it, is believed to be an obvious case of this loss or gain. presenting the spoken sound track of a film in writing and synchronizing the perception of this text by the viewers with...

15 صفحه اول

A Brief Survey of Parametric Value Function Approximation A Brief Survey of Parametric Value Function Approximation

Reinforcement learning is a machine learning answer to the optimal control problem. It consists in learning an optimal control policy through interactions with the system to be controlled, the quality of this policy being quantified by the so-called value function. An important subtopic of reinforcement learning is to compute an approximation of this value function when the system is too large ...

متن کامل

evaluation of fat atrophy in patients with dermis fat graft of orbit by ct-scan

چکیده ندارد.

Pseudorehearsal in value function approximation

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Qlearning with function approximation in a pole balancing task. We have found that pseudorehearsal seems to assist learning even in such very simple problems, given proper initialization of the reh...

متن کامل

Maintenance Cost Analysis for Replacement Model with Perfect Minimal Repair

With the evolution of technology, the maintenance of sophisticated systems is of concern for system engineers and system designers. The maintenance cost of the system depends in general on the replacement and repair policies. The system replacement may be in a strictly periodic fashion or on a random basis depending upon the maintenance policy. At failure, the repair of the system may be perfor...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Vestnik Udmurtskogo universiteta

سال: 2021

ISSN: ['1994-9197', '2076-5959']

DOI: https://doi.org/10.35634/vm210402